T-tilt: a modified tilt model for F0 analysis and synthesis in tonal languages

نویسندگان

  • Ausdang Thangthai
  • Nattanun Thatphithakkul
  • Chai Wutiwiwatchai
  • Anocha Rugchatjaroen
  • Sittipong Saychum
چکیده

This paper proposes a modified Tilt model, called T-Tilt, for analyzing and synthesizing F0 contours in tonal languages. The Tilt model successfully designed for intonation modeling is extended to cover syllable-based F0 realization influenced strongly by the tonal context. Two modification approaches include adding a parameter indicating a F0 curve pattern and separating duration and amplitude controls inherent in the Tilt parameter for sake of flexibility. Evaluations are conducted by both an objective RMSE measure and a subjective MOS test on intelligibility and naturalness aspects. Applying to Thai and Mandarin Chinese continuous speech, the proposed model is proved to be very effective for F0 contour analysis. It rather requires extensive work on parameter synthesis although the synthesizing performance is comparable to those produced by other proposed models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The tilt intonation model

The tilt intonation model facilitates automatic analysis and synthesis of intonation. The analysis algorithm detects intonational events in F0 contours and parameterises them in terms of the continuously varying Tilt parameters. We describe the analysis system and give results for speaker independent spontaneous dialogue speech. We then describe a synthesis algorithm which can generate F0 conto...

متن کامل

Analysis and synthesis of intonation using the Tilt model.

This paper introduces the Tilt intonational model and describes how this model can be used to automatically analyze and synthesize intonation. In the model, intonation is represented as a linear sequence of events, which can be pitch accents or boundary tones. Each event is characterized by continuous parameters representing amplitude, duration, and tilt (a measure of the shape of the event). T...

متن کامل

Generation of fundamental frequency contours for Thai speech synthesis using tone nucleus model

As classic and intrinsic requirements, synthetic speech need to convey correct information with good quality of naturalness to listeners. Fundamental frequency (F0) contours need to be controlled to meet these requirements. Additional challenges have been introduced to tonal languages because the F0 contour reflects both intelligibility and naturalness of the speech. According to the fact that ...

متن کامل

An Acoustic Study of Emotivity-Prosody Interface in Persian Speech Using the Tilt Model

This paper aims to explore some acoustic properties (i.e. duration and pitch amplitude of speech) associated with three different emotions: anger, sadness and joy against neutrality as a reference point, all being intentionally expressed by six Persian speakers. The primary purpose of this study is to find out if there is any correspondence between the given emotions and prosody patterning in P...

متن کامل

An Overview of Prosodic Modelling for Croatian Speech Synthesis

In order to include prosody into the text to speech (TTS) systems prosody knowledge needs to be acquired, represented and incorporated. Two main features of prosody important for modelling prosody for TTS systems are duration and F0 contour. There are various approaches to modelling those features and they can be categorized into three main groups: rule based, statistical and minimalistic. Some...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008